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Abstract. We consider the application of fluctuation relations to the dynamics 
of coarse-grained systems, as might arise in a hypothetical experiment in which 
a system is monitored with a low-resolution measuring apparatus. We analyze 
a stochastic, Markovian jump process with a specific structure that lends itself 
naturally to coarse-graining. A perturbative analysis yields a reduced stochastic 
jump process that approximates the coarse-grained dynamics of the original 
system. This leads to a non-trivial fluctuation relation that is approximately 
satisfied by the coarse-grained dynamics. We illustrate our results by computing 
the large deviations of a particular stochastic jump process. Our results highlight 
the possibility that observed deviations from fluctuation relations might be due 
to the presence of unobserved degrees of freedom. 
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1. Introduction 

Coarse-graining is at the heart of equihbrium statistical mechanics. It is used to 
describe systems with a macroscopic number of degrees of freedom with the help of a 
much smaller number of variables. A familiar example is the description of a gas in 
terms of its slowly varying local density rather than the position of all its molecules [1] . 
Coarse-graining is particularly important in the study of phase transitions, and more 
generally, interacting systems. It is a crucial building block in renormalization group 
theory [51 [3]. Even far from a phase transition the combination of coarse-graining 
with the identification of relevant and irrelevant variables can sometimes be used to 
replace seemingly complicated interactions by (approximate) simpler ones. 

The theory of systems out of equilibrium is less developed and, in particular, 
the possible uses of coarse-graining in such a theory are not as clear. One approach, 
recently reviewed by Derrida [4] , uses large deviation theory to address the fluctuations 
of a local density or some other coarse-grained field. This leads to a functional of the 
local density that characterizes the steady state and fluctuations around it. 

In recent years, a different approach has been employed to study systems far from 
equilibrium. In this approach one investigates symmetries, related to time-reversal, 
that are associated with the probabilities of observing sequences of events. Such 
investigations have led to the discovery of a number of interesting results, collectively 
known as fluctuation relations, including some that provide information related to the 
likelihood to observe so-called "second law violations" [SJ [HI [I]. Loosely speaking, 
fluctuation relations have the form 
' PjAS = qT) ' 
_P{AS = -qT)_ 

where P{AS) is the probability to observe a change AS* in entropy, T is the observation 
time, and q is the observed entropy creation rate. [The notation o(T) indicates a 
correction that grows more slowly than T, that is limr^oo o{T)/T = 0.] These results 
are quite general, they remain valid far from thermal equilibrium, and they are part 
of an active field of research focused on the application of the laws of thermodynamics 
to microscopic systems [5]. 

Fluctuation relations are derived by considering statistical distributions of 
microscopic trajectories of the system of interest. For any such trajectory, we assign 
a value to a quantity AS, which we interpret physically as a measure of entropy 
production. By comparing the probability of observing a given trajectory (7) with 
that of its time-reversed counterpart (7), the symmetry represented by equation ([1]) 
emerges (see section |3]). This approach, however, runs into difficulties when coarse- 
graining is involved. Consider an experiment in which some degrees of freedom cannot 
be observed, perhaps because they evolve on time scales much faster than the response 
time of the detector. In this case our observation of the system gives us a coarse-grained 
- i.e. a "smeared", or locally averaged - trajectory, rather than a full microscopic 
record of the evolving state of the system. It is not at all evident that such coarse- 
grained trajectories satisfy the same symmetry relations as their fully microscopic 
counterparts. Moreover, important information might be lost due to coarse graining; 
specifically, the expression for entropy production, AS", might contain contributions 
from those degrees of freedom that are not resolved by the detector. Under these 
circumstances, can we define some sort of "coarse-grained entropy production" , AS"^*^, 
which satisfies a fluctuation relation? 
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Our goal in this paper is to give partial answers to such questions. We will examine 
a simple system for which an assumed separation of time scales leads to a simple 
coarse-graining transformation. The system we study is a stochastic jump process 
whose probability distribution satisfies a Master equation. Fluctuation relations for 
such jump processes were first derived by Lebowitz and Spohn , following Kurchan's 
analysis of Langevin processes jlO] . These two papers laid the groundwork for much 
subsequent work on stochastic fluctuation relations, which has recently been reviewed 
by Harris and Schiitz [11] . In the present paper, we impose a separation of time scales 
and argue that, when the slow component of the dynamics is accurately measured, a 
non-trivial fluctuation relation, which differs from the one valid for the microscopic 
system, is (approximately) satisfied. This relation involves a coarse-grained entropy 
production, which depends only on the slow component of the evolution. 

We note that in the context of fluctuation theorems, an explicit separation of time 
scales has appeared as well in the work of Zamponi et.al. [T^- These authors have 
studied a Langevin particle interacting with a nonequilibrium thermal environment 
consisting of slow and fast components at different temperatures (see also [13]). For 
such a system a fluctuation-dissipation relation between spontaneous and externally 
induced fluctuations can be used to define an effective temperature. The system was 
shown to satisfy a generalized fluctuation theorem formulated in terms of this effective 
temperature. 

In section [2] we will present the model used and apply a coarse-graining 
transformation, replacing it by a simpler system. The (approximate) coarse-grained 
system is also Markovian and therefore satisfies a fluctuation relation of its own. In 
section [3] we use the existence of a coarse-grained counterpart to define coarse-grained 
trajectories, and a coarse-grained entropy production A5'*^'^. These trajectories can be 
heuristically viewed as measurements obtained with a limited- resolution detector. We 
show that the coarse-grained entropy production approximately satisfies a fluctuation 
relation (for the original system). A simple example is numerically studied in section 
|4j in order to illuminate the results obtained in previous sections. The results are 
summarized in section [5] 

2. Stochastic systems with simple coarse-graining 

While there are many physically relevant and interesting examples of systems with 
natural coarse-grained counterparts, we will focus on stochastic jump processes 
governed by master equations. After briefly reviewing general properties of such 
processes, we present models with special structure, which makes them amenable to 
coarse-graining. We then use simple perturbation theory to motivate a coarse-graining 
transformation that replaces the system of interest by a smaller, reduced system. 

Consider a system composed of a finite number of distinct states. The system 
can make transitions between these states, and these transitions are assumed to be 
described by a Markov process. Specifically, given two states a and a', the rate 
R{a\<7') is the probability per unit time to make a transition to a, from a current 
state a'. While not all transitions are allowed (some of the rates vanish), we do 
assume that if the transition from a' to a is allowed [R{a\a-') ^ 0] then also the 
reverse transition is allowed \R{a'\a) ^ 0]. This assumption is analogous, but not 
equivalent, to microscopic reversibility for deterministic systems. 

It is useful to depict such a system with the help of a graph, in which each node 
represents a state and each link represents a pair of non-vanishing transitions. For 
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Figure 1. A stochastic system described with the help of a graph. The nodes 
correspond to the states of the system. Each Unk represents a non-vanishing 
transition between two states, and its reversed counterpart. The system is fully 
specified when the transition rates are known. 



instance, the graph shown in figure [T] illustrates a five-state system with six non- 
vanishing links. Note that we allow multiple links between a given pair of states, as 
between states 2 and 3. In other words, there may exist information that allows one to 
distinguish between different ways to make a transition. This implies that four rates - 
namely, i?Q=i,2(3|2) and i?Q=i,2(2|3) - are needed to describe all transitions between 
states 2 and 3. 

Such a process satisfies a master equation 

dP{a,t) 



dt 



Y^R{a\o')P{a',t), (2) 



where P(ct, t) denotes the normalized probability to find the system in the state a at 
time t. It is useful to think of P{t) as a vector whose components are labeled by the 
index a, and R as a matrix. A non-diagonal element R{(j\a') is given by the sum 
of the rates from a' to cr, e.g. R{a\a') — 'Yl,a^a{<^W), whereas a diagonal element 
R{a\(j) specifies the net probability rate of transitions out of the state a. To ensure 
conservation of probability, the rates satisfy 

Y.R{a\a')^0. (3) 

a 

We assume that the graph is connected, that is, it is possible to go from any state to 
any other state using the links. In that case, the solution of equation ^ decays with 
time to a unique stationary solution satisfying [Mj 

Y,R{a\a')P\a')^Q. (4) 

The stationary probability current J^{a',a) = Ra{cr\a')P^{a') — Ra{a'\a)P^{a) 
measures the net flow from cr' to ct, via the transition a, in the stationary state. 
If = for every transition, then detailed balance is satisfied, and it is natural to 
think of the system as being in a state of thermal equilibrium. For a generic transition 
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Figure 2. Graph of a stochastic system with a meaningful coarso-graining 
transformation. The solid lines denote links with large transition rates (of order 
unity). The dotted lines denote weakly connected states (with rates of order e). 

matrix R, however, the currents J„ are non- vanishing, and then we consider the system 
to be in an out-of-equihbrium steady state [15] . 

Systems described by master equations are, generaUy, relatively easy to handle 
mathematically. However, our interest is in systems for which it is useful to coarse- 
grain the dynamics. This leads us to examine systems with special structure. 
Specifically, we will assume that the N states of our system are assorted into L clusters, 
with "strong" links within a given cluster, R{a\a') ^ 1, and "weak" links between 
states belonging to different clusters, R{(j\a') ~ e ^ 1, as illustrated in figure [2] for 
iV = 9, L = 3. We further assume that each separate cluster is (internally) connected, 
and also that the entire system is connected. Finally, we assume that there is at most 
one microscopic transition connecting any pair of states in the systerrQ- (It will become 
clear that coarse- graining does not preserve this property.) A system satisfying these 
conditions is characterized by two widely separated time scales, corresponding to the 
fast dynamics of relaxation within each cluster (ry •^1), and the slow redistribution 
of probability among the clusters (r^ ^ e~^). 

The microscopic evolution of our system is fully described by a sequence of 
transitions from state to state, a — > a' — » a" ■ ■ along with the times at which 
these transitions occur. We introduce coarse graining by imagining that we monitor 
this evolution using a low-resolution apparatus, capable of distinguishing different 
clusters, but not different states within a cluster. The dynamics we observe then 
consists of transitions from cluster to cluster, c — » c' — > c" • • •, which provides only 
partial information regarding the underlying microscopic trajectory. In general this 
low- resolution dynamics is not a Markov process, and cannot be described by a master 
equation of the form given by equation ^ . We aim to investigate whether this coarse- 
grained sequence satisfies a meaningful fluctuation theorem. 

Let us imagine, for a moment, that such a coarse-graining transformation maps 
the original system onto a smaller system, whose states correspond to the clusters of 

X The discussion can be generalized to several possible transitions easily. We use this assumption 
since we find the idea of a microscopic level, where the dynamics is simpler, appealing. 
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Figure 3. The structure of possible simple systems, obtained by coarse-graining 
the system depicted in figure [2] 



the original system. For instance, the system depicted in figure [2] is mapped onto a 
3-state system. Note that there may be several difi^erent transitions between a pair 
of clusters. The ability of the low-resolution apparatus to distinguish between these 
transitions determines the structure of the simpler system. For instance, the system 
depicted in figure[2]will be mapped to the one depicted in figure[3^ when the apparatus 
can distinguish between the transitions connecting the same clusters. Alternatively, 
when the apparatus can only distinguish the clusters, but can not be used to identify 
the transitions, the system will be mapped to the one depicted in figure [3Jd. 

In order to give a more quantitative meaning to the notion of coarse-graining, we 
now develop a perturbation theory that maps our original A^-state stochastic jump 
process onto a reduced L-state stochastic jump process; each state of the reduced 
system represents one of the clusters in the original system. Such a mapping is useful 
only if it preserves relevant dynamical properties. In our case this means the reduced 
dynamics ought to provide a good approximation of the low-resolution dynamics of 
the original system. In what follows we show that the structure that we have assumed 
above leads to a separation between fast and slow decay modes of equation The 
latter are then used to set the transition rates of the reduced, L-state process, in such 
a way that this process does, indeed, faithfully reproduce the low-resolution dynamics 
of the original system, to leading order in e. 

For systems with the cluster structure depicted in figure [21 we decompose the 
transition matrix, R: 

Ria\a') = R^"\cT\a') + eR'^^l ia\a'). (5) 

Here R'"' is block-diagonal, with each block corresponding to one cluster [see 
equation (pijl for an example]. The non-diagonal elements of eR'^^ are the weak 
links connecting states belonging to different clusters, while the diagonal elements of 
£R(^) are adjusted so that equation ([3]) holds separately for R'"' and R'^^. We assume 
that the dependence of R on e resides entirely in the factor that multiplies R^^^ in the 
above decomposition. Thus all non-vanishing elements of R^''^ and R*-^-* are assumed 
to be order unity. We then treat eR^^^ as a small perturbation to R(°^ 

To keep considerations simple, let us assume that both R and R*^*^-* have full sets 
of left and right eigenvectors, satisfying bi-orthogonality. Thus for R we have 

Rv„ = A„v„ 
w^R = A^^^l 

• V« = Smn, (6) 
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with m,n = 1,2, - ■ ■ , N. The same relations hold among the eigenvalues and the left 
and right eigenvectors of R^^^ , denoted by An"^ , and Vn^"* . 

The transition matrix R has one vanishing eigenvalue, Ai = 0. All other 
eigenvalues are negative. The corresponding right eigenvector vi is proportional to 
the steady state distribution, and the left eigenvector Wi expresses conservation of 
probability. Since bi-orthogonality does not fix the normalization of these eigenvectors, 
we choose, for convenience, Vi — and = (1, 1, • • • , 1). 

To carry out our perturbation analysis, we expand eigenvectors and eigenvalues 
in powers of e, 

oo oo oo 

v„=5:e'vW , w„ = 5:s'w(0 , A„ = ^e'Am , (7) 
/=o /=o ;=o 

and we match powers of s in equation Q . At leading order we get 

R(")vi")=A(°)v(°), (8) 

and similarly for Wn '' ■ Making use of the block-diagonal structure of R^"\ and 
diagonalizing each block separately, we obtain eigenvectors Vn"'' and Wn'' that inherit 
this structure: each eigenvector has non-vanishing elements only in one block, 
corresponding to one cluster. 

The block-diagonal structure of R*^°-' describes L mutually isolated clusters, each 
of which supports its own steady state and corresponding vanishing eigenvalue. Thus 
R^"^ has L vanishing eigenvalues, A^"^ = for n = 1, • • • , L. The corresponding right 
eigenvectors, Vn \ are composed of blocks of vanishing elements with the steady state 
of the n'th cluster, ^^{n;a) inserted at the appropriate place. (Here a runs over 
states belonging to the n'th cluster.) The corresponding left eigenvectors, v/n \ have 
a similar structure, with the elements '^^{n; a) replaced by I's. The remaining N — L 
eigenvectors of R^°^ have negative eigenvalues of order unity. 

In first-order perturbation theory, the term eR*^^-* in equation ([5]) shifts each 
eigenvalue by an amount of order e. While this represents a relatively small change in 
the non- vanishing eigenvalues A^|^, • • • , A^\ it crucially lifts the degeneracy among 
the first L (vanishing) eigenvalues. These are replaced by a single vanishing eigenvalue 
(corresponding to the steady state of the fully connected system) and L — 1 negative 
eigenvalues of order e, which describe the long time decay of the system. Assuming for 
simplicity that the degeneracy is fully lifted at first order, higher order corrections will 
lead only to small further changes in the eigenvalues and eigenvectors. Therefore, in 
the following we will concentrate on the application of first-order perturbation theory 
to the degenerate sector, n < L. 

Degenerate perturbation theory begins with the L x L matrix 

n(n|m) ^ wW^RWvW , m,n^l,---,L, (9) 

whose eigenvalues represent the first-order corrections, An \ to the vanishing 
eigenvalues of R'^°-' . Because of the correspondence between the clusters of the system 
and the left/right eigenvectors in the above equation, the quantity n(n|rn) represents 
the transition rate between clusters m and n. In the following we emphasize this 
identification by using indices c, c' = 1, • • ■ , L to specify the clusters. 
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The slow decay modes of the system are determined by the matrix 11 whose 
dimensionahty is the number of clusters, L. This matrix satisfies 

L L N N 

c— 1 c— 1 a,(T' — l {7,a' — l 

where we have used X]c=i ^c"'' (c) = 1- In other words 11 satisfies the conservation- 
of-probability condition expressed (for the full system) by equation ([3]). It is thus 
meaningful to interpret the equation, 

dP{c,t) 



dt 



Een(c|c')P(c',t), (11) 



as describing a Markov jump process. We now discuss the physical meaning that can 
be assigned to 11 and P(c, t). 

The separation of time scales between the L — 1 slow and the N — L fast decay 
modes, implies that after the decay of the latter the probability distribution P((t, t) 
satisfies P{a, t) ~ '^^{c] (y)P{c, t). Thus the probabihty inside each cluster is a product 
of its local steady state distribution, ^P'', and the total probability to find the system 
in the cluster, P{c,t) = J^aec^i'^^^)- Equation (fTT|) approximately describes the 
time evolution of the latter. More precisely, equation (fTT|) governs a reduced (L-state) 
Markov jump process, and this reduced process provides an approximate description of 
the transitions between clusters within the original system. Note that the individual 
elements of 11 differ from those of R^^^: for a transition out of a cluster c, with the 
rate e+ appearing in eR^^\ the probability current is e+P{a,t) ~ e+^i^{c;a)P{c,t). 
This contributes a rate e+^^{c; a) in the appropriate place in eTl. 

The procedure outlined above is not the only way to define a reduced system. An 
alternative approach would involve measuring the transition rates between clusters 
in the steady state of the full system, and then constructing a reduced, L-state 
Markov jump process governed by precisely these transition rates. We expect that 
this "empirical construction" - based on the observed cluster-to-cluster transition 
rates - and the "perturbative construction" leading to equation (fTTj) . will give reduced 
systems that become equivalent in the limit e — > 0. 

To summarize, we have used perturbation theory to give quantitative meaning 
to a coarse-graining transformation, in which a Markov jump process among N 
states, arranged into L clusters, is replaced by a reduced Markov jump process 
among L states meant to represent those clusters. In the following section, where 
we consider fluctuations, it is important to bear in mind that the reduced dynamics 
only approximately reproduce the statistics of transitions between clusters of the 
original system. Of course, one might improve the approximation by extending the 
perturbation expansion to higher orders of e; see, e.g. the Appendix of reference [TB]. 
However, higher-order corrections to the eigenvectors of a particular degenerate sector 
typically involve a/Z unperturbed eigenvectors, not only those within that sector. Thus 
it is not generally possible to reduce the dimensionality of the system when considering 
higher orders in e. 



3. Fluctuation relations for systems that can be coarse-grained 



In this section we ask whether the structure described in section[2]leads to a non-trivial 
fluctuation relation. We start by briefly reviewing the fluctuation relation for general 
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stochastic jump processes [HI [T71 [T51 [TT]. We will then turn to systems amenable to 
coarse-graining . 

Fluctuation relations are obtained by comparing probability densities of 
microscopic paths and their time reversed counterparts. A path, or a history, 7(r), 
will denote a stochastic trajectory, represented by an ordered sequence of transitions 
at given times (between say and T), 

7 = 0-0 > 0-1 > 0-2 > • • • > CTn- (12) 

The a's in equation denote the transitions between states. They are redundant 
for the original system, but would be needed after coarse-graining, where several 
transitions between clusters are possible. We have chosen to explicitly denote the 
transitions in order to use similar notation for the original and coarse-grained system. 
The conditional probability density (or weight) corresponding to a path is 

n n 
wh) = ]^exp [i?(cr^|cri)(ti+i -ti)] ]J i?Q^.(crj|crj_i). (13) 

1=0 j=l 

In equation —R{ai\(Ti) denotes the overall rate of probability flow out of the state 
ffi, and we set to = and tn+i = T. The probability density of a path is obtained by 
multiplying w{'y) by the probability to sample its initial conditions, P{af),t — 0). 

To each path we can define a time reversed counterpart 7(i) = 7(T — i), starting 
at CT„ at t = 0, and ending at ctq at t = T. Since the path 7 and its counterpart 
7 reside in the same states for the same lengths of time their exponential survival 
probability factors in equation are identical. As a result 

P(7) _ P(ao,t = 0)w(7) ^ P{ao,t = 0) ^ R^^{aM^-l) 

P(7) P(t7„,i = 0)u.(7) P(a„,i = 0) fjj i?a,(a,_i|(T,)' ^ ' 

Taking the logarithm of both sides yields 

P(7) P((7„,i = 0) ^ P«.(a,_i|fT,) ir{ 

The right side includes a boundary term, B, arising from the initial and final states, 
and a sum of contributions from the n transitions. If we interpret 

^^ = ln§44fi (16) 

as the amount of entropy that is produced when the system makes a transition a from 
ct' to cr [HI E], then 

n 

l^S = ^5S^ (17) 

1=1 

is the net entropy production associated with the trajectory 7, and we have 



Identities of the form of equation ()18|) are at the heart of both steady-state and 
transient fiuctuation relations |llj . The distinction between the two cases, roughly, 
amounts to whether one chooses to consider the limit T ^ 00 (in which case the 
relative contribution of B can be neglected, at least for finite systems |9]), or rather 
to argue that the boundary term B represents the net change in the entropy of the 
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system (in which case the sum on the right represents the entropy produced in the 
system and its surroundings [17]). In either situation the desired fluctuation relation 
follows in a more or less straightforward manner. 

For our purposes the distinction between the transient and steady state cases is 
not of central importance. Rather, we aim to investigate whether a relationship similar 
to equation (fT8| can be obtained for systems of the type studied in section[2l To clarify 
this issue, let us return to the situation introduced briefly in section [2l in which we 
monitor the evolution of our system using a low-resolution apparatus that distinguishes 
between clusters, but not between states within a cluster. Under these circumstances 
we do not observe the full trajectory, 7, but rather a coarse-grained version F (defined 
below) that describes only transitions between clusters. Can we expect this trajectory 
to satisfy a fluctuation relation? In what follows we use the perturbation analysis of 
section [2] to define a coarse-grained entropy production, AS"^*^ , that differs from AS" 
above, and is expressed in terms of the observed transition rates. We argue that this 
quantity approximately satisfies a fluctuation relation, and that the approximation 
improves as the separation of time scales becomes more pronounced (i.e. as e ^ 0). 
The deviations from this (approximate) relation are more pronounced for extreme 
values of AS'*-"^, which select rare trajectories that, in some sense, do not respect the 
separation of time scales. These deviations reflect the non-Markovian nature of the 
observed, coarse-grained dynamics, and can be viewed as evidence of the existence of 
unobserved degrees of freedom. 

The reduced, L-state system defined in section [2] is a stochastic, Markov jump 
process that satisfies an exact fluctuation relation. The entropy produced during a 
transition a from c to c' is given in this case by 

and the net entropy production associated with a trajectory is just a sum of such 
contributions. To simplify notations, we assume here, and in what follows, the 
scenario where different transitions between the same clusters are distinguishable. The 
following considerations can be easily modified to apply to coarse-grained dynamics 
with indistinguishable transitions. 

Let us now return to the full, iV-state system, but assume the conditions of a 
low-resolution apparatus. Suppose the evolution of the system from t = to t = T is 
described by a trajectory 7. To this trajectory let us assign a coarse-grained entropy 
production A5'^'^[7], which is a sum of contributions from jumps between different 
clusters. Specifically, when the system makes a transition a from cluster c' to cluster 
c, AS^^ is updated by an amount 

There are no changes to AS*^*^ arising from (unobserved) jumps within clusters. Thus 
equation (j20p defines a coarse-grained entropy production for the A^-state system, 
under low resolution, by borrowing the definition that arises in the reduced, L-state 
Markov system, equation (fTO|) . We now argue that 

A5C« = E'^^fc ° (21) 

k 

(the sum includes only cluster-to-cluster transitions) satisfies an approximate 
fluctuation relation. 
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Since only the transitions between different clusters are observed under low- 
resolution conditions, let us define a coarse-grained path 

r^{7|co^-i^^ci^-^^...c„,}. (22) 

We interpret F as the collection of all paths 7, of duration T, that start in the cluster 
Co at time 0, make the transition ai to cluster ci at time ri and so forth. Thus 
while the inter-cluster transitions of all trajectories in F are identical, the intra- 
cluster transitions typically differ among these trajectories. Note that we retain 
information on the identity (a) of a transition between two clusters, if only to identify 
the microscopic states involved. Moreover, since AS**-^*^ has the same value for every 
7 e F, we take this value to define the coarse-grained entropy production associated 
with F. 

The probability density assigned to the coarse-grained path F is 

TO = E^(^)- (23) 

This is computed by summing over all possible combinations of transitions inside 
the cluster and integrating over their corresponding transition times, keeping the 
inter-cluster transitions fixed. Each coarse-grained trajectory has a time reversed 
counterpart f , which passes through the clusters in the reversed order, at the 
appropriate times. Let us now compare the statistical weights associated with such a 
conjugate pair of coarse-grained trajectories, F and F. 

The weight of a coarse-grained trajectory is a product of factors (partial weights) 
resulting from the different segments of the microscopic trajectories. A segment here 
refers to the interval of time - bracketed by a pair of transition times, say and Tk+i 
- during which the system remains in exactly one cluster. [E.g. equation specifies 
a path with n' + 1 segments.] 

To compute the partial weight of a given segment, consider the system shown in 
figure m with five states and two outgoing links. The latter represent the decay of the 
probability to remain within this system: when a trajectory makes a transition along 
such a link, it departs the system and does not return. Let ^{a,t\a' ,to) denote the 
conditional probability to find the system at (cr, i), given that it was at {a', to). 

The partial weights discussed above are contributions from trajectory segments 
that perform exactly this type of stochastic dynamics. By specifying F one specifies 
all the transitions between the clusters. Given such a transition, ak, let us denote 
the microscopic states before and after the transition by cr^ and a'^ , respectively. 
The contribution from all trajectory segments entering Ck at {a^ ,Tk) and leaving this 
cluster for the first time at (cr^^^, r^+i), is just /i((T^^j, Tfc+i|cr^, r^). [| 

For the systems studied here, the separation of time scales suggests that the 
system is likely to relax rapidly within a cluster before departing that cluster, which 
in turn suggests a simple approximation: 

M(f^fc+i,T-fc+i|o-A!',rfc) ~ exp [en(cfe,Cfc)(Tfc+i - Tk)]^'{ck;<T^+i). (24) 

Here —ell{ck,ck) is the approximate rate of escape from Ck, assuming the system has 
relaxed within this cluster, while ^^{ck; Cfc+i) is the steady state probability to be at 
^k+i isolated cluster Ck- At this level of approximation, the weight /i does not 

§ Note that each cluster has its own conditional probability distribution fj,. We will not introduce 
new notation to explicitly differentiate between the different functions fi, since the identity of the 
cluster is already specified in the microscopic states that appear as arguments of fi. 




Figure 4. A cluster of states with some additional outgoing transitions. 
Stochastic motion in such a system is related to partial contributions to weights 
of coarse-grained trajectories, see text. 



depend on the initial state . We defer discussion of possible corrections to equation 
((M)) to a later stage. The probability density of a coarse-grained trajectory is then 

n'-l 



V{T) 



P(fTo,i = 0)MK-,Ti|ao,0) 



.Co SCO 



n A*K+i,Tfc+iifT^,Tfe) 



(25) 



k=l 



k=l 



The two factors in square brackets, corresponding to the initial and final clusters, 
account for the fact that the initial and final microstates are not fully specified by F. 
Substituting equation ([24| into equation ([25]) leads to 



n n 

r{T) 2, P(co,t = 0) n exp [eU{ck\ck)irk+i - r^)] [] [£i?W(a+|a;)r (c^; a,")' 



fc=0 



k=\ 



= P(co,t = 0) n exp [en(cfc|cfc)(rfc+i - r^)] [] en„,(cfc|cfc_i), (26) 

fc=0 k=\ 

where IIqj, (cfc|cfc-i) is the contribution of the process at, to the element (cfc,Cfc_i) 
of the reduced transition matrix 11 (see section [2]) . The right side of this equation 
is exactly the weight of a trajectory in the reduced, Markovian system whose time 
evolution obeys equation (fTT|) . In other words, equation ((26|) states that the coarse- 
grained trajectories F are described by approximately the same statistics as those of 
the reduced, L-state Markovian system. Evaluating the ratio of weights for a conjugate 
pair F and f , and applying our definition of AS**-^^ [equation ((2T|) ]. we get 



v{r) p(c„,,t = o) 



exp(A^c«[F]) 



(27) 
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(The factors containing the diagonal elements of 11, appearing in equation (j26p . cancel 
in this ratio.) At this point we have arrived at an analogue of equation psp . the 
crucial relation from which both transient and steady-state fluctuation relations are 
obtained. Our analysis thus reveals that the coarse-grained entropy production AS*^*^ 

- which is defined for the full, A^-state system under conditions of low resolution, but 
which was motivated by a comparison with the reduced, L-state system of section [2] 

- satisfies an approximate fluctuation relation. We will rederive this result, using 
different considerations, in the next section. 

Let us briefly take a closer look at the coarse-grained entropy, equation ([^0]) . 
Comparing the definitions of SS'^'^ and SS, we find 

dS^^{a)^SSia)+ln '^^f'''\l (28) 

Entropy changes in stochastic jump processes are often separated into two 
contributions with different physical interpretations [15] . The first term on the right 
hand side of equation (pS)) expresses the change of the entropy of a thermal medium 
that is in contact with the system [THl [IB El [HI [501 HI] ■ The second term is related 
to changes in the entropy of the system itself, if we interpret —\nP{a{t),t) as the 
entropy of the system at time t along a stochastic trajectory a{t) [T7]. Equation ([^5| 
indicates that coarse-graining does not respect the distinction between the microscopic 
medium and system entropies. Both contribute to changes in the coarse-grained 
entropy. 

Note also that transitions within the clusters contribute to AS* [equation p?| ]. 
but not to AS**-^^ [equation (|2ip ]. In fact, there may be situations where the full, 
A'^-state system relaxes to a steady state characterized by non-zero stationary currents 
within the clusters, but transitions between clusters satisfy detailed balance. This 
would correspond to a state with a non- vanishing rate of entropy production AS", but 
a vanishing rate of coarse-grained entropy production, AS**^^. 

The claims made in previous paragraphs rely on equation (j24p . which is an 
approximation. We should therefore consider possible corrections to equation (|24p 
and their effects on the fluctuation relation. We will use heuristic rather than rigorous 
arguments to estimate these corrections. 

The corrections to equation (j24p depend on the time difference Srk = Tk+i — t^. 
For typical time differences, comparable to the lifetime in the cluster, equation (|24p is 
a good approximation, but we can expect it to have errors of order e. For very short 
(^Tfc ~ 1) and very long (Jr^ Ine/e) time differences we may expect large errors. 
However, the probability of such time differences is itself of order e. Such events are 
not typical. 

The (coarse-grained) steady state fluctuation relation applies to the long time 
behavior of (eT)"i In [P(AS"^^ = qeT)/P{AS'^^ = -qsT)] . The only source of errors 
will then appear in sums over terms of the form In //(u, T|cr', t')/ fi{a' , T — T'\a,T — t). 
The number of terms in the sum is proportional to the number of transitions between 
clusters, which in turn scales as eT. Let us consider flrst the corrections for a typical 
trajectory. The corrections are a combination of typical terms with small errors and 
rare terms with a large error, as discussed in the previous paragraph. The resulting 
errors in estimating the entropy production is small, and scales as e^T. 

The steady state fluctuation relation does not deal only with typical trajectories. 
In fact, one examines trajectories whose entropy production rate is (approximately) 
fixed. This will pick out trajectories which may be very unlikely. We assume that for 
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a finite range of entropy production rates the relevant trajectories are sucli tliat the 
time differences for which the approximation made in equation break down are 
still unlikely (with probability which scales as e). With this assumption, the coarse 
grained-entropy satisfies 

1 . P(AS'CG = eqT) 



lim 



In- 



Oie), 



(29) 



eT P(AS'CG = -eqT) 

for some finite range of q value^. Note that one can not expect this relation to hold for 
any coarse-grained entropy production rate q. Taking this rate to be "large enough" 
will eventually pick trajectories whose typical time differences are of order t/. These 
trajectories will never spend enough time in clusters to equilibrate. 



4. Illustrative Example 

In this section we use a simple example to illustrate fluctuation relations in a coarse- 
grained setting, and to gain some intuition for the admittedly abstract considerations 
of the previous section. We want to use the simplest system possible whose reduced 
counterpart can exhibit out-of-equilibrium steady states. This implies at least three 
clusters of states. 

Consider a system with eight states, = 8, organized into three clusters, L = 3. 
The transition matrix describing the full system is 



(30) 
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(31) 



and 
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(32) 



(The non-vanishing rates are in bold-face.) Numbering the states of the full system 
from 1 to 8, the three clusters are: (1,2,3), (4,5), and (6,7,8). We see that R has 
the structure described in section [2| all off-diagonal elements of R'-'^-' are between 
states within a given cluster, and all off-diagonal elements of R*^^-' are between states 

I The numerical results presented in section |4] hint that the error term in equation II29I I depends on 
g in a non-trivial manner. 
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belonging to different clusters. We begin our analysis of this system by constructing 
the reduced, three-state Markov system. 

The steady state distributions for the (unconnected) clusters appearing in R^"' 
are given by the null vectors of the three matrices along the block diagonal: *p*'(l; cr e 
1) = 5^(25, 13, 18), <P'^(2; cr e 2) i(3, 2) and *P"(3; £ 3) = i(3, 5, 1). From these 
we easily construct the eigenvectors spanning the null space of R'-*'-' . Using equation 
([HI) we then obtain 

(_5 6 ^ \ 

I T -U^ 

This transition matrix defines the dynamics of the three-state reduced system, and 
approximately describes the cluster-to-cluster evolution of the full, eight-state system. 

To evaluate the probability distribution of entropy production rates in the limit 
of large observation time, T, direct simulations are of limited use, as the asymptotic 
probability to observe a specific average entropy production rate decays exponentially 
with T. Thus obtaining good statistics becomes problematic. Following Lebowitz and 
Spohn [9] (see also [H]), we consider the large deviation function 

fiq) = hm 4 In P(A5CG ^ ^^^^ j.^^ (34) 
T^oo el 

and its Legendre transform 5(A). Here q denotes the observed entropy production 
rate, while P(AS'^^) is the probabihty distribution of this entropy production. To 
differentiate between the large deviation function of the reduced and coarse-grained 
system we will denote the former (latter) by a superscript red (CG). 

Lebowitz and Spohn have shown that g{X) is the largest eigenvalue of an operator 
Ti-iX) which governs the time evolution of averages of the type (e"'*'^'^). The diagonal 
elements of Ti. are those of R, while non-diagonal elements are multiplied by a factor 
related to e""**^"^ where the entropy change is that of the transition corresponding to 
this non-diagonal element. We refer for references [9] and [18j for the derivation. 

For the reduced system, whose transition matrix is given by equation (I33p . this 
operator is given by 




5 ( 81 
9 V 140 



H(A)=£ 13 (84)-^ _16 2^ ■ (35) 



14 
9 



This operator has the property 'H{X) — 7i^{l — A), which implies the steady state (or 
Gallavotti-Cohen) fluctuation relation f'^iq) - r°'^{-q) = g [Hi]. Note that the 
dependence of H on e is trivial. To cancel this dependence we define (7™'*(A) to be the 
maximal eigenvalue of H divided by e. 

Let us turn back to the original system. We have defined a coarse-grained entropy 
which changes only during stochastic transitions connecting different clusters, see 
equation ([20|l . One can apply the formalism developed by Lebowitz and Spohn to 
study the distribution function of entropy production for the coarse-grained dynamics. 
The derivation follows closely that of JLSJ . Again, the Legendre transform of the large 
deviation function is given by the largest eigenvalue of an operator (divided by e). 
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The relevant operator is 



n{x) = 



^-3 3 2 \ 

2 -8-4£ 3 2£(||)^ 

1 5 5 £ £{tw)^ 

4£(||)^ -2-2£ 3 

2 -3-5£ 3£2^ 

5£2"^ -4-3£ 2 2 

ef^)^ 3 -3-£ 6 



140 ^ 

VOO 1 1-8/ 



Note that only elements which connect different clusters depend on A. This results 
from the fact that the coarse-grained entropy changes only during these transitions. 
It is important to notice that 7i(A) ^ T-L^{1 — A). This means that the coarse-grained 
entropy does not satisfy an exact fluctuation relation. It does, however satisfy an 
approximate one. 

Consider Ti. for small values of e. When £ = we find that H = R(°). We 
have already studied the eigenvalues and eigenvectors of R*^"-* in section [2l see also 
the discussion leading to equation (l33|) . The largest eigenvalue is with three 
corresponding pairs of left and right eigenvectors. All other eigenvalues are negative 
and of order unity. For small e one can study the eigenvalues of 7i(A) using degenerate 
perturbation theory. The largest eigenvalue must be part of the degenerate sector. 
Moreover, performing the leading order perturbation theory leads to W(A) as the 
matrix whose eigenvalues are the leading order approximation for the eigenvalues of 
'H(A) that belong to the degenerate sector. 

Therefore, the function g'~^'^{X), obtained by computing the largest eigenvalue 
of ([5^ approximately satisfies ^'"''^(A) = 9^*^(1 — A) (with corrections of order 
e). This holds not only for the largest eigenvalue but to all the eigenvalues in the 
degenerate sector. This perturbation theory for the Legendre transform of the large 
deviation function can be considered as another justification for the existence of an 
(approximate) fluctuation relation for the coarse-grained entropy. 

We have calculated numerically the largest eigenvalue of 'H(A), as a function 
of A and compared it to the largest eigenvalue obtained from 7t!(A). The results 
are depicted in figure [5l It is clear that the eigenvalue corresponding to the coarse- 
grained system is a good approximation to the one obtained for the reduced system. 
Moreover, the approximation improves for smaller e. The function g'^°'^{X) for the 
reduced system vanishes at A = and has the exact symmetry 5''®'^ (A) = g'^'^'^{l ~ A) 
as expected. The corresponding functions obtained for the coarse-grained system start 
to deviate from that function for larger values of |A|. This is related to the fact that 
the perturbation theory cannot be uniformly valid. For instance, examining equation 
([M]) . it is clear that, for any e, one can find A* so that, say, 2"^ £ ~ 1. As a result, 
terms which were assumed to be a small perturbation cease to be small. This can be 
loosely interpreted as picking contributions from highly unlikely trajectories, which 
may break our assumption of local relaxation. 

The Legendre transform of the function g{X) is the large deviation function /(g) 
which describes the asymptotic (with time) behavior of the probability distribution of 
the entropy production. The Legendre transform can be computed numerically. For 
both systems studied here the maximum of /(g) was found to be at g ~ 0.08. This is 
the most likely value of the coarse-grained entropy production rate in a long enough 
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Figure 5. The largest eigenvalue of i) ?Tl(A), corresponding to the reduced system 
(solid line), ii) W(A), pertaining to the coarse-grained dynamics, with e = 0.01 
(dashed), and with e = 0.05 (dashed-dotted). (All eigenvalues were divided by e.) 



experiment, in agreement with the entropy production rate calculated for the steady 
state of the reduced system, q'^'^'^ ~ 0.0833. The fact that this value does not vanish 
indicates that the system relaxes to non-equilibrium steady state, in which detailed 
balance is violated. 

To verify that the coarse-grained entropy (approximately) follow the Gallavotti- 
Cohen fluctuation relation we have calculated /"^"^(t?) — f'^'^{—q) and compared it 
to the expected linear behavior. The results are depicted in figure [S] It is clear 
that fluctuation relation is a good approximation for the coarse-grained entropy. 
The approximation improves for smaller values of e, as expected. Note that the 
approximation deteriorates for larger values of entropy production rates. One may 
speculate that this is a result of the increased weight of trajectories that do not spend 
enough time in the clusters to relax to the local stationary state. 

The numerical results, presented in this section, were for a coarse-grained entropy 
defined using a reduced system, which in turn was obtained from the perturbation 
theory of section [2l We have mentioned that there is another way to define a reduced 
system, with the help of the steady state of the microscopic system. The rates of 
this empirically-constructed reduced system differ from the ones used so far by higher 
orders of e. The different rates will result in a different definition of the coarse- 
grained entropy. One may wonder whether this coarse-grained entropy will satisfy the 
fluctuation relation with much smaller deviations. We have repeated the numerical 
calculation performed in this section for this definition of coarse-grained entropy (with 
£ = 0.05, results not shown). The deviation from the fluctuation relation were found 
to be similar to those presented in figured 




Figure 6. Plot of /^^(g) _/CG (.g) 

as a function of the measured coarse-grained 
entropy production rate q. The solid line depicts the prediction of the fluctuation 
relation. 



5. Conclusion 

In this paper, we have studied stochastic jump processes which can be coarse-grained. 
The coarse-grained dynamics can be viewed as the dynamics of the system measured 
using a low resolution apparatus. A perturbation theory, based on a separation of 
time scales, was used to define a reduced system. The dynamics of this reduced 
system, which is Markovian (and therefore satisfies an exact fluctuation relation), 
approximates the coarse-grained system. This reduced system then motivates the 
definition of a coarse-grained entropy for the coarse-grained dynamics. This entropy 
was found to approximately satisfy a fluctuation relation. Deviations from this relation 
are more pronounced for large coarse-grained entropy production rates. This deviation 
can be interpreted as a result of the existence of internal degrees of freedom, which 
lead to deviations from Markovian behavior of the coarse-grained entropy. 

Let us consider an experiment whose goal is to measure fluctuation relations. The 
considerations in this paper may be useful in the interpretation of the results of such 
an experiment. If the results exhibit deviations from the fluctuation relation one may 
suspect the existence of unobserved degrees of freedom. (However, other mechanisms, 
leading to deviations from the expected linear behavior, exist [551.) Note that, even if 
there are internal degrees of freedom, it is entirely possible that the entropy production 
rates, needed for observing deviations from the fluctuation relation, are so rare that 
such events will not be measured during the experiment. 

The results found in this paper lead to several new questions. For instance, a 
better understanding of the deviation of the coarse-grained entropy production from 
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the fluctuation relation is needed. The definition of the coarse-grained entropy used in 

this paper was motivated by the definition apphcd to microscopic systems. One may 
wonder whether it is possible to define another coarse-grained entropy, using only 
quantities measured by the low resolution apparatus, which will exactly satisfy the 
fluctuation relation. It is also of interest to extend the considerations, obtained here for 
stochastic systems, to deterministic systems. Finally, we have seen that the existence 
of a coarse-grained counterpart leads to a physically motivated coarse-grained entropy 
exhibiting a fluctuation relation. One may wonder whether the reverse is also true. 
Does the existence of a quantity, which has some physical interpretation, and satisfies 
a non-trivial fluctuation relation, suggest that the system can be simplified in some 
way? Such questions are left for future work. 
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